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ABSTRACT 

Proposed cosmological surveys will make use of photometric redshifts of galax- 
ies that are significantly fainter than any complete spectroscopic redshift surveys 
that exist to train the photo-z methods. We investigate the photo-z biases that 
result from known differences between the faint and bright populations: a rise 
in AGN activity toward higher redshift, and a metallicity difference between in- 
trinsically luminous and faint early-type galaxies. We find that even very small 
mismatches between the mean photometric target and the training set can induce 
photo-z biases large enough to corrupt derived cosmological parameters signifi- 
cantly. A metallicity shift of ~ 0.003 dex in an old population, or contamination 
of any galaxy spectrum with ~ 0.2% AGN flux, is sufficient to induce a 10~ 3 
bias in photo-z. These results highlight the danger in extrapolating the behavior 
of bright galaxies to a fainter population, and the desirability of a spectroscopic 
training set that spans all of the characteristics of the photo-z targets, i.e. ex- 
tending to the 25th mag or fainter galaxies that will be used in future surveys. 

Subject headings: Data Analysis and Techniques, Galaxies 



1. Introduction 

Photometric redshifts have been used extensively in studies of galaxy evolution, par- 
ticularly for sources that are too faint for feasible spectroscopic detection of absorption or 
emission lines. Photo-z's are just beginning to find application to measurements of cosmic 
structure, particularly in the determination of source redshifts for weak gravitational lensing 
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Yet the cosmological measures that weak-lensing surveys have as goals are extremely 
sensitive to biases or other miscalibrations of the photo-z scale: biases as small as ~ 10~ 3 (1 + 



z) w ill overwhelm the statistical errors on cosmology in these experiments (IHuterer et al. 



20061 ). The photo-z technique has never been subject to such demands on its accuracy so 
we need to examine carefully the strategies for achieving them. 

A photo-z algorithm must somehow be taught to convert broadband fluxes into redshifts. 
This can be done by fitting to theoretical or empirical templates for galaxy spectra (e.g. 
Benftezl 120001) ; or by co mpletely empirical machine- learning techniques, e.g. neural nets 
(jCollister &: Lahavll2004l ). that are trained with a sample of galaxies of known spectroscopic 
redshift. There is a difficulty, however, in that many imaging surveys intend to measure 
photo-z's from galaxies that are too faint to obtain the spectroscopic information needed to 
form templates or obtain spectroscopic training redshifts. One must therefore take on some 
degree of faith the assertion that a photo-z algorithm trained on brighter galaxies will return 
sufficiently unbiased redshifts for galaxies beyond the spectroscopic limit. 

We investigate in this paper the sizes of biases that we might expect to arise because of 
differences between the faint galaxy population and the brighter spectroscopic population. 
In particular: 



• Fainter galaxies will tend to be at higher redshifts, where active galactic nuclei (AGN) 
are more frequent or brighter than at low redshift. Furthermore, optically-based spec- 
troscopy is inefficient in the "desert" between 1.5 < z < 3, where quasar activity peaks. 
We can ask, therefore, whether the different AGN characteristics of the fainter galaxies 
could bias the photo-z's trained on low-z galaxies. 

• For elliptical galaxies, there is a well-known color-magnitude relation. As we move 
to lower luminosity galaxies in this red sequence, stellar populations traverse a path 
in the age-metallicity plane. Stellar populations of the luminous galaxies of different 
Hubble types that would typically be used for photo-z training will not cover the 
same locus in the age-metallicity plane. Would photo-z's trained on bright galaxies 
produce biased redshifts for fainter ellipticals? We investigate this by examining the 
metallicity dependence of photo-z results for old stellar populations when training 
on the bright Hubble sequence. There are additional complications from the age- 
metallicity degeneracy. Accordingly, we also estimate the age dependent behavior of 
photo-z biases. 
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In the Rumsfeld parlance, these are "known unknowns" of the faint galaxy population: 
a photo-z algorithm might be designed which could compensate for these two differences 
between faint and bright galaxies. But we will use a photo-z algorithm that is ignorant of 
these effects, so that they can serve as models for "unknown unknowns" which might bias 
faint photo-z estimates. In this way we might learn how safe or dangerous it is to extrapolate 
training sets or templates to fainter magnitudes. 



2. Photo-z fitting technique and template sets 



Probing AGN contamination biases necessitates the preparation of an appropriate galaxy 
spectrum and the use of fitting software to determine the photometric redshift z p . Contam- 
inated galaxies are modeled by introducing a QSO spectrum as an impurity to the spectral 
energy distributions (SEDs) of galaxies. The Le Phare^ software package is used to compute 
the photometric redshifts. Le Phare compares theoretica l and known source magnitudes, 
estimating z p via \ 2 minimization (cf. lArnouts et a/.lll999l . 120021 ). 



Le Phare fits to a magnitu de catalog of template galaxies. I n this case, the catalog 



consists of 4 k nown galaxies from Coleman. Wu. fc Weedmanl ( )1980l )[CWW] and 3 simulated 
galaxies from iBruzual fc Charlotl (119961 ) [BC96] as well as linear interpolations (via flux) of 
these. The CWW galaxies are early types, Sbc and Scd spirals, and irregulars, while the 
BC96 galaxies are 2, 0.5, and .05 Gyr in age. Altogether, including interpolations, the catalog 
is composed of 72 models. We investigate two sets of filters. The first includes 4 Hubble Deep 
Field North (HDFN) filters (F300W, F450W, F606W, and F814W) and three near-infrared 
filters (Jbb, H, and K) included in the Le Phare software package. The second set is the 5 
Sloan Digital Sky Survey (SDSS) filter set (u, g, r, i, and z). Le Phare compiles this catalog 
by calculating the magnitudes for each galaxy SED with each filter at redshifts from to 6 
in increments of .04. Note that the template galaxies contain no AGN signal. 

A second input catalog describes the galaxies for which z p is to be estimated. We syn- 
thesize AGN-contaminated magnitudes by: making a QSO template spectrum; normalizing 
this template to the galaxy SEDs; contaminating the Le Phare templates with a specified 
fraction of AGN flux; then calculating apparent magnitudes at various redshifts and AGN 
cont aminat ions . 

A template QSO spectrum over a wide wavelength s pan i s needed in order to synthe- 
size AGN contaminated SEDs. IVanden Berk et all ( 1200 ll ) and iGlikman. Helfand. &: White 



^ttp : //www . oamp . f r/people/arnouts/LE_PHARE . html 



-4- 



( 120061 ) provide suitable QSO template spectra; the former covers a broad UV-NIR (800.5A- 
8554. 5A) wavelength range while the latter extends further into the NIR (580lA- 35095. 4A). 
We scale the Glikman spectrum to match the Vanden Berk spectrum at 847 nm, where both 
are well measured, and then concatenate the two. 

The QSO contamination is specified by the fraction < h < 0.2 of the flux contributed 
by the QSO. Specifically, for a chosen galaxy template g(X) and our template QSO spectrum 
q(X) we first determine the normalization iV which yields 

g(X)d\ = N / q(X)dX, (1) 
A2 J \\ 

where the integration bounds Ai = 80 nm to A2 = 3.5 /xm span the range over which both 
spectra are known. For a desired contamination h, the galaxy rest-frame SED is taken as 

f(X) = (l-h)g(X) + Nhq(X). (2) 

Apparent magnitudes in the chosen filter sets are calculated using the standard formulae for 
redshifted sources. 

We use the same QSO template spectrum for all levels of contamination. In reality, 
low-luminosity AGN will have different spectra than bright QSOs, e.g. weaker broad lines, 
however the effect of contamination on photo-z's that we derive will still be crudely correct. 

The catalog of AGN-contaminated galaxies contains 7 galaxy types — the 4 CWW tem- 
plates (of type E, Sbc, Scd, and Irr) and the 3 BC96 simulated star forming galaxies (denoted 
12, 105, and 1005 for their ages) — at redshifts in the range < z < 3 in steps of .1 for SDSS 
filters, and in steps of .1 from < z < 1 and steps of .2 for 1 < z < 3 for the HDFN filters, 
with AGN contaminations in the range < h < 0.2 in 0.01 increments. 

We run Le Phare to obtain estimates for z p . The quantity of interest is the redshift bias 
between the nominal and photometric redshifts, to wit, Az = z p — z. 



3. Bias from AGN contamination 

All galaxy types using either filter set have a non-trivial bias Az induced by AGN 
contamination. We first examine how Az depends on contamination fraction h, at fixed z 
and galaxy type. Figure [T] illustrates the range of behaviors found. The 12 galaxy shows a 
simple linear bias-contamination response with a well-defined d(Az)/dh. In cases like the 
Scd galaxy at z — 0.7, the linear trend gains steps or wiggles, which we believe is induced by 
the discrete steps in galaxy type that Le Phare takes when finding a best-fit template. In this 
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Fig. 1. — Plots of Az vs. h with various behaviors. 



case we define d(Az) /dh by a least-squares fit through all the data points. Finally there are 
cases like the high redshift E-type in which the best-fit photo-z becomes "catastrophically" 
incorrect above some contamination level. Here the AGN light has moved the galaxy toward 
some new part of the color manifold of the templates rather than inducing a small differential 
change. We define d(Az) / dh by manually selecting a region of h that is below the catastrophe 
and fitting a slope to this region. We essentially ignore the catastrophic failures because we 
are more interested in biases induced by gradual evolution in the AGN fraction. We will 
focus henceforth on the slope d(Az)/dh in the vicinity of h = 0. 




i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i_ 

1 2 3 1 2 3 

Redshift (z) Redshift (z) 



Fig. 2. — The response d{Az)/dh of photo-z bias to AGN contamination is shown for two filter sets 
(HDFN on the left and SDSS on the right). The bias response is plotted vs redshift for different 
galaxy types. For clarity, in the plot we require \d(Az)/dh\ < 2.5 which cuts out some catastrophic 
errors. 
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Figure |2] shows d(Az) jdh from the low-contamination domain in both filter sets. The 
different templates show a variety of behaviors, and some are erratic due to discreteness 
in the Le Phare fitting, particularly with the SDSS filter set, which offers poor photo-z 
constraints in some domains due to lack of NIR coverage. 

The important point, however, is that the bias responses d(Az)/dh are ~ ±0.5 over most 
of the color and galaxy- type domain. A 1% contamination with AGN light causes a photo-z 
bias of Az ~ 0.005 if the photo-z algorithm is using uncontaminated templates. Application 
of photo-z's to cosmic-shear measurements require that biases be held below ~ 0.001, which 
means that the mean AGN contamination of the target sample would need to be determined 
to 1% in order for the resultant photo-z biases to make insignificant impact on the inferred 
cosmology. We will not know the faint-galaxy AGN fraction to anywhere near this precision, 
so this would be a major problem. 

Of course we are using a naive photo-z estimator: one could includ e a QSO templat e 



in the fitting process (as in the option Le Phare offers, or as described in lWolf et al\ (120011 ). 
which includes object classification), or attempt to construct an empirical training set of 
bright galaxies that spans the range of AGN contamination of the faint set. This would 
greatly reduce the photo-z bias in our simple simulated catalog. The behavior of real AGN 
is, however, much more complex than our simple single QSO template, and it is likely that 
the faint population has different AGN spectra than the bright galaxies that might serve as 
templates. We simply point out that photo-z biases can be very sensitive to AGN light, and 
it is risky to extrapolate AGN characteristics to a new population. 



4. Age-Metallicity Models 

We use stellar population models to estimate the redshift bias induced by a metallic- 
ity mismatch between target galaxies and Le Phare's templates (Le Phare uses the same 
extended CWW catalog used above). The galaxy evolution simulation code GALAXEV@ 
is used here to generate a magnitude catalog for galaxies with near-so lar metallicity with 



a z = ages of 10 Gyr. We use the standard model as described in iBruzual fc Chariot 



( 120031 ) [BC03], Specifically, the galaxy models employed here use the STELIB and BaSeL 
spectral libraries, a simple stellar population (SSP) star formation rate, the Chabrier initial 
mass function, and the 1994 Padova evolutionary tracks (per recommendation of the code's 
authors) . 



http : //www2 . iap . f r/users/charlot/bc2003/ 
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GALAXEV produces a table of magnitudes and ages at various redshifts for a given 
galaxy model and filter set. The same SDSS and infrared supplemented HDFN filters from 
the AGN contamination study are used to calculate these magnitudes. GALAXEV produces 
colors for metallicities Z/Z & = 0.2,0.4,1.0, and 2.5 (models m42, m52, m62, and m72 in 
GALAXEV). We also produce colors for metallicities 0.3, 0.7, and 1.75Z by interpolating 
between these four GALAXEV spectra. 

We further employ GALAXEV to investigate biases due to age misestimates. We gen- 
erate solar metallicity models with z = ages of 11, 9, and 8 Gyr. Note that the fiducial 
model here is still the 10 Gyr old solar metallicity galaxy, but in contrast to what is done in 
the metallicity study, we now hold the metallicity fixed and vary the age. 



5. Bias From Metallicity Indeterminacy 

Calculating the bias due to a small metallicity mismatch requires removing the fidu- 
cial bias present resulting from inherent differences between the GALAXEV spectra and 
Le Phare's catalog of templates. At each redshift, we take an elliptical galaxy with solar 
metallicity as our benchmark. The solar bias, 5z & , is defined as the difference between the 
solar model's redshift and Le Phare's estimate: 5z & = % )mod — %, phot . For any particular 
metallicity, the bias due to metallicity mismatch is then Az = z phot — z mo d c i — <5z Q . Hence 
by definition we assign zero metallicity bias to a solar-metallicity population, for any chosen 
filter set, population age, and redshift. 

Figure [3] illustrates the biases from metallicity offsets as a function of redshift between 
z = and z = .7. Five models are present in the plot, each is 10 Gyr old at z = with 
metallicities 0.2-2.5Z Q . As the models get more metal rich, the redshift bias increases, that 
is, the photometric redshift gets larger than the model's redshift. In the redshift range that 
was studied, the super- and sub-solar models both had biases of up to ±.2 for metallicities 
~ ±0.5 dex away from solar. This suggests that biases of ~ 0.001(1 + z) would result 
from shifts of only ~ 0.003 dex in the mean metallicity of an old population away from 
the metallicity of the training set. For comparison, current modelling of age and metallicity 
variation along the red sequence of SDSS ga laxies suggests a ~ 0. 3 dex change in metallicity 



across 4 magnitudes of galaxy luminosity ( jBernardi et all 120051 ) . Therefore the biases in 



photo-z's of faint red-sequence galaxies can be highly significant if the photo-z's are trained 
exclusively on brighter red-sequence members. 
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6. Bias from Age Indeterminacy 



Biases from age estimates errors are computed in the same manner as in the metallicity 
case. The intrinsic bias from the benchmark lOGyr solar metallicity galaxy, fcioGyn must 
be subtracted from the total, Sz m0( ^, to arrive at the bias attributable to age mismatching: 
A^age = ^mod ~ z p ~ %0Gyr = <^mod ~ ^lOGyr- 

Figure 0] portrays the induced biases in both filter sets. Compared to the metallicity- 
induced biases, these show little structure outside of the feature near z = A for the SDSS 
filters (also found in the metallicity plots), perhaps suggesting that the photo-z's are more 
sensitive to metallicity offsets. For the HDFN filter set, the bias only becomes comparable 
at higher z when the age offset is 2 Gyr. In both studies, the HDFN sets exhibit a slightly 
wider range in biases which may mean that extra bands may be more confusing for Le Phare 
if the templates and data are not initially well matched. 




Fig. 3. — Metallicity-induced biases are shown from rest-frame to z=.7 for HDFN (left) and SDSS 
(right) filters. Each model's metallicity is shown on the plot in solar units. The dashed and dotted 
lines serve distinguish between the models in the crowded regions. The smoother behavior of the 
HDFN+IR plot is attributable to the supplementary IR bands — the HDFN data acquire jagged 
profiles when the analysis is done sans the IR bands. Indeed, these plots (not shown) resemble the 
SDSS plots; while the overall range of the bias decreases, the curves behave more erratically. 




Fig. 4. — Shown above are the age-induced biases for solar metallicity models with < z < .7 
for the HDFN (left) and SDSS (right) filter sets. Model ages are distinguished by color. As with 
the metallicity-induced bias, a reanalysis of the HDFN data with NIR data excluded yields plots 
comparable to the SDSS. 



7. Conclusion 



Photometric redshifts resemble the real redshifts only as closely as the training sets 
mimic the objects of interest. In the cases of AGN activity and metallicity variation, we 
find that even very small mismatches between the mean photometric target and the training 
set can induce photo-z biases large enough to corrupt significantly the validity of derived 
cosmological parameters. In particular, we find that a metallicity shift of ~ 0.003 dex in an 
old population, or contamination of any galaxy spectrum with ~ 0.2% AGN flux, is sufficient 
to induce a 10 -3 bias in photo-z. Miscalculating the age of a galaxy by 2 Gyr may also result 
in similar a bias. In a real survey, we can expect differences far larger than this between 
the faint photometry survey targets and the bright spectroscopic targets used for photo-z 
training. 

While simple models are used here to study redshift biases, the results indicate a need 
to develop training sets that encompass the full range of behavior of the photo-z target 
population. Our study uses a worst-case situation in that the photo-z algorithm is given 
no information on the physical effects underlying the bias (AGN or a luminosity-metallicity 
correlation). In reality the training sets can include galaxies with AGN of varying brightness 
and some galaxies of lower luminosity, which would ameliorate the biases we have found. 
However the extreme sensitivity of the photo-z to these effects suggests that it will be dan- 
gerous to extrapolate from the behavior of bright training-set galaxies to a target photo-z 
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sample that extends to significantly higher redshift and lower luminosity, since even subtle 
differences in spectral behavior can lead to important biases. Furthermore we have inves- 
tigated two known differences between the faint and bright populations, but there may be 
differences that have not yet been discovered because there are no detailed spectroscopic 
surveys of the galaxies fainter than 24th magnitude that will comprise the bulk of future 
weak-lensing surveys. These results highlight the desirability of training photo-z's with com- 
plete spectroscopic surveys to ~25 mag, so that the training set is representative of the 
galaxies for which photo-z's are being obtained. 

This work is supported by grant AST-0607667 from the NSF and DOE grant DE- 
FG02-95ER40893. We are very grateful to Mariangela Bernardi and Gordon Richards for 
their advice and assistance. We also thank Stephanie Jouvel for instruction in the use of 
Le Phare. 
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